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REMARKS 

I. INTRODUCTION 

Claims 1, 3-5, 11-18, 20 and 21 have been amended. Claim 19 had been 
canceled. Claims 22-24 have been added. Thus, claims 1-18 and 20-24 remain pending in the 
present application. No new matter has been added. In view of the above amendments and 
following remarks, it is respectfully submitted that all of the presently pending claims are 
allowable. 

IL THE 35 U.S.C. 8 112 REJECTIONS SHOULD BE WITHDRAWN 

Claim 1 stands rejected under 35 U.3.C. § 1 12. (See 03/22/06 Office Action, p. 2, 
lines 16-18). Applicant has amended claim 1 to provide sufficient antecedent basis for the 
limitation "audio command." Thus, it is respectfidly submitted that the rejection be withdrawn. 

III. THE 35 U.S.C. S 103(a^ REJECTIONS SHOULD BE WITHDRAWN 

Claims 1-18 and 20-21 stand rejected under 35 U.S.C. § 103(a) as unpatentable 
over U.S. Patent No. 5,999,214 to Inagaki ("Inagaki") in view of Pavlovic et al., "Integration of 
audio/video information for use in a human-computer intelligent interaction," Image processing, 
1997 Proceedings IEEE, ("Pavlovic") in view of U.S. Patent No. 6,154,723 to Cox et al. 
("Cox"). (See Id., p. 3, lines 1-5). 

Amended claim 1 recites, a 'Video display device comprising: a display 
configured to display a primary image and a picture-in-picture image (PIP) overlaying the 
primary image; and a processor operatively coupled to the display and configured to receive a 
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first video data stream for the primary image, to receive a second video data stream for the PIP, 
to recognize an audio command related to a PIP display characteristic, the processor, upon 
recognizing the audio command, activates an image acquisition component that is configured to 
recognize a user hand gesture related to manipulating the PIP display characteristic, the 
processor manipulates the PIP display characteristic according to the audio command and the 
hand gesture" (Emphasis added). 

Inagaki generally relates to video conference system including a video camera for 
imaging participants in a conference, a plotting image input unit for receiving plotting data from 
a participant, and a synthesization unit for synthesizing an image including plotting image data 
being input and video data of the participant who is entering the plotting image data. (See 
Inagaki, Abstract). Specifically, Inagaki describes a system including a voice direction detection 
unit for detecting a speaking individual within a conference of people. (See Id., col. 11, lines 65- 
67). The voice direction detection unit allows for changing the imaging direction of a camera to 
point towards the speaking individual. (See Id., col 12, lines 4-7). 

Pavlovic generally relates to a human-computer intelligent interaction using 
auditory and visual features in a virtual environment. (See Pavlovic, p. 121, Abstract). 
Specifically, Pavlovic describes a system using multimodal integration wherein visual and audio 
feature vectors are combined to create a joint audio-visual feature. (See Id., p. 122, § 2.4). The 
system allows a user to manipulate virtual objects using a set of gestures and spoken commands. 
(See Id., p. 123, §3). 

Cox generally relates a three-dimensional virtual reality system for creation, 
manipulation and editing, wherein the system includes a voice and three dimensional gesture 
input interface. (See Cox, Abstract). Specifically, Cox describes a system including the use of 
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voice commands to trigger input from a spatially-tracked hand-held wand to a virtual director in 
a virtual immersion environment. (See Id., col. 5, lines 10-19). The position and orientation of 
sensors within the hand-held wand are magnetically tracked by the system. (See Id., col. 4, lines 
6-10). Thus, the tracking of the user gesture according to Cox requires the user to be within an 
immersion environment and furthermore, the tracking of the gesture requires the user to operate 
the magnetically tracked wand. 

In contrast to Inagaki, Pavlovic and Cox, the present invention describes a system 
wherein an image acquisition component is activated upon the system's recognition of an audio 
command. (See Specification, p. 1 1, 11. 8-18). By activation of the image acquisition component 
after the audio command, the present invention is able to prevent false activation of the system 
due to an inadvertent hand gesture from the user. In addition, the present invention allows for 
conservation of system resources since the image acquisition component is activated upon 
recognizing the audio command. Thus, the system will not unnecessarily expend resources to 
monitor the user until the user activates the system using the audio command. Finally, the 
present invention relates to the acquisition of a user hand gesture as opposed to requiring the use 
of magnetically tracked wand as described in Cox. 

It is respectfully submitted that disclosures of Inagaki, Pavlovic and Cox fail to 
teach or suggest, either alone or in combination, each of the claim limitations recited in claims 1- 
18, and 21-24. Specifically, neither Inagaki, Pavlovic nor Cox teach or suggest a 'Video display 
device comprising: a display configured to display a primary image and a picture-in-picture 
image (PIP) overlaying the primary image; and a processor operatively coupled to the display 
and configured to receive a first video data stream for the primary image, to receive a second 
video data stream for the PIP, to recognize an audio command related to a PIP display 
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characteristic, the processor, upon recognizing the audio command, activates an image 
acquisition component that is configured to recognize a user hand gesture related to 
manipulating the PIP display characteristic, the processor manipulates the PIP display 
characteristic according to the audio command and the hand gesture," as recited in claim 1. 

Applicant respectfully submits that for at least the reasons stated above, claim 1 of 
the present application is not obvious over Inagaki in view of Pavlovic and in view of Cox, and 
request that the rejection of this claim be withdrawn. As claims 1-10 and 22 depend from, and 
therefore include all the limitations of claim 1, it is hereby submitted that these claims are also 
allowable. 

The Examiner rejected claim 1 1 using similar arguments to the rejection of claim 
1 over Inagaki in view of Pavlovic and in view of Cox. (See 03/22/06 Office Action, p. 8, lines 
3-11). Claim 1 1 recites a *\ . .activating an image acquisition component upon deteraiining the 
received audio command as recognized; receiving a hand gesture from a user. . Therefore, 
Applicant respectfully submits that claim 1 1 is allowable for at least the reasons discussed above 
with regard to claim 1. As claim 12-14 and 23 depends from, and therefore includes all the 
limitations of claim 1 1, it is hereby submitted that these claims are is also allowable. 

The Examiner rejected claim 15 using similar arguments to the rejection of claim 
1 over Inagaki in view of Pavlovic and in view of Cox. (See 03/22/06 Office Action, p. 9, lines 
3-7). Claim 1 5 recites ". . .a program segment for activating an image acquisition component 
upon determining the received audio command as recognized. . Therefore, Applicant 
respectfully submits that claim 1 5 is allowable for at least the reasons discussed above with 
regard to claim 1. As claim 16-18 and 24 depends from, and therefore includes all the 
limitations of claim 15, it is hereby submitted that these claims are is also allowable. 
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The Examiner rejected claim 20 using similar arguments to the rejection of claim 
1 over Inagaki in view of Pavlovic and in view of Cox. (See 03/22/06 Office Action, p. 9, lines 
8-13). Claim 15 recites "...wherein the processor is configured to recognize an audio commands 
is identified as an audio indication to change the PIP display characteristic, upon recognizing the 
audio command the processor activates an image acquisition component.. Therefore, 
Applicant respectfully submits that claim 20 is allowable for at least the reasons discussed above 
with regard to claim 1 . 

The Examiner rejected claim 21 using similar arguments to the rejection of claim 
1 over Inagaki in view of Pavlovic and in view of Cox. (See 03/22/06 Office Action, p. 9, lines 
14-16). Claim 21 recites . .a program segment for activating an image acquisition component 
upon determining the received audio command as recognized. . Therefore, Applicant 
respectfully submits that claim 21 is allowable for at least the reasons discussed above with 
regard to claim 1. 
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CONCLUSION 



In light of the foregoing, Applicant respectfully submits that all of the now 
pending claims are in condition for allowance. All issues raised by the Examiner having been 
addressed. An early and favorable action on the merits is earnestly solicited. 



Please direct all future correspondence to: 

Larry Liberchuk, Esq. 
Senior IP Counsel 

Philips Intellectual Property & Standards 
P.O. Box 3001 

Briarcliff Manor, NY 10510-8001 

Phone: (914) 333-9602 

Fax: (914)332-0615 

Email: larry,liberchuk@philips.com 



Respectfully submitted, 



Dated: June 14, 2006 
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